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■ Abstract 

We consider one-round games between a classical referee and two players. One of the main 
questions in this area is the parallel repetition question: Is there a way to decrease the maximum 
winning probability of a game without increasing the number of rounds or the number of 
players? Classically, efforts to resolve this question, open for many years, have culminated in 
Raz's celebrated parallel repetition theorem on one hand, and in efficient product testers for 
PCPs on the other. 

In the case where players share entanglement, the only previously known results are for 
^ . special cases of games, and are based on techniques that seem inherently limited. Here we 

' show for the first time that the maximum success probability of entangled games can be re- 

duced through parallel repetition, provided it was not initially 1. Our proof is inspired by a 
seminal result of Feige and Kilian in the context of classical two-prover one-round interactive 
proofs. One of the main components in our proof is an orthogonalization lemma for operators, 
(N ■ which might be of independent interest. 

OO 
(N 

^ ■ Two-player games play a major role both in theoretical computer science, v\^here they have led to 

many breakthroughs such as the discovery of tight inapproximability results for some constraint 
satisfaction problems, and in quantum physics, where they first arose in the context of Bell in- 
equalities. In such games, a referee (or verifier) chooses a pair of questions from some distribution 
and sends one question to each of two non-communicating players (or provers), who then respond 
^ I with answers taken from some finite set. The referee, based on the questions and answers, decides 

• whether to accept (i.e., whether the players win). The main question of interest is the following: 

given the referee's behavior as specified by the game, what is the maximum winning probability 
achievable by the players? Somewhat surprisingly, the answer to this question turns out to depend 
on whether we force the players to behave classically, or allow them to use quantum mechanics. 
In the former case, the players' answers are simply deterministic functions of their input^E and 
the maximum probability of winning is known as the (classical) value of the game. In the latter case 



1 Introduction 
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the players, though still not allowed to communicate, may share an arbitrary entangled state and 
each perform arbitrary measurements on their share of the state. The maximum winning prob- 
ability in this case is known as the entangled value of the game. This model of entangled players 
(also known as that of non-local games) dates back at least to the work of Tsirelson, and it has been 
intensely studied in recent years; yet many questions about it are still wide open. 

One of the most important and interesting questions in this context is the parallel repetition 
question. It is well known that one can reduce both the value and the entangled value of a game 
by repeating it sequentially, or alternatively, by repeating it in parallel with several independent 
pairs of players. However, for many applications (like hardness of approximation results or ampli- 
fications preserving zero-knowledge) we need a way to decrease the winning probability without 
increasing the number of rounds or the number of players, i.e., while staying in the model of two- 
player one-round games. Parallel repetition is designed to do just that: in its most basic form, in 
the ^-parallel repeated game, the referee simply chooses £ pairs of questions independently and 
sends to each player his corresponding ^-tuple of questions. Each player then replies with an £- 
tuple of answers, which are accepted if and only if each of the £ answer pairs would have been 
accepted in the original game. 

Clearly the value of an i'-parallel repeated game is at least the £-th power of the value of the 
original game, since the players can just answer each of the £ questions independently as in the 
original protocol. However, contrary to what intuition might suggest and to the case of sequential 
repetition, parallel repetition does not necessarily decrease the value of a game in a straightfor- 
ward exponential mannei@. The parallel repetition question is that of finding upper bounds on the 
value of a repeated game, and for a long time no such upper bound, even very weak, could be 
proved. First results date to Verbitsky IIVer94l who showed that indeed the value goes to zero with 
the number of repetitions. Following this, Feige and Kilian IFKOOI showed that the value decreases 
polynomially with the number of repetitions for the special case of so-called projection games (in 
which the second player's answer is uniquely determined by the first player's). They used a mod- 
ified parallel repetition procedure in which a large fraction of the repetitions are made of dummy 
rounds, that is, rounds in which the questions are chosen independently at random for both play- 
ers, and in which any answer is accepted. In this paper we deviate somewhat from the common 
terminology, and use the term "parallel repetition" even when referring to such more general pro- 
cedures. Finally, in a breakthrough result, Raz IIRaz98ll showed that the value of a game repeated 
in parallel indeed decreases exponentially with the number of repetitions (albeit not exactly at 
the same rate as sequential repetition). There is still very active research in this area, mostly on 
simplifying the analysis, which, over a decade later, remains quite involved, and improving it for 
certain special cases of games IIHol07llRi3MlFKO07llRii08llBHH+08[lBI^+09[lAKK+MlMT0ll . 



1.1 Previous work 

In this paper we focus on parallel repetition of games with entangled players. The only two previous 
results in this area are for two special classes of games. First, Cleve et al. showed that for the 
class of XOR games (i.e., games with binary answers in which the referee's decision is based solely 
on the XOR of the two answers), perfect parallel repetition holds IICSUU08I . This means that the 

^See |Fei91| for a classical example, and ICSUU08I for an example using entangled players due to Watrous. See 
also IKRIOI for another example where parallel repetition does not reduce the value of a game at the exact rate one 
would expect if the players were playing independently. 
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entangled value of an ^-parallel repeated game is exactly the i-th power of the entangled value 
of the original game. Parallel repetition has also been shown to hold for the more general (but 
still quite restricted) class of unique games IIKRT08II (i.e., games where the referee applies some 
permutation to the answers of the second player and accepts if and only they match those from 
the first player). One might also add a third result by Holenstein IIHol07|l . who proved a parallel 
repetition theorem for the so-called no-signaling value; since the no-signaling value is an upper 
bound on the entangled value, this can sometimes be used to upper bound the entangled value 
of repeated games. However, there is in general no guarantee regarding the quality of this upper 
bound, and in many cases (e.g., all unique games) the no-signaling value is always 1, making it 
useless as an upper bound on the entangled value. 

It is important to note that in these results the entangled value of the parallel repeated game 
is never analyzed directly; instead, one uses a "proxy" such as a semidefinite program IICSUU081 
IKRT08II or the no-signaling value IIHol07L whose behavior under parallel repetition is well un- 
derstood. Moreover, in all these cases, the proxy's value is efficiently computable. This unfortu- 
nately gives a very strong indication that such techniques cannot be extended to deal with general 
games. Indeed, it is known that it is NP-hard to tell if the entangled value of a given game is 1 



or not IKKM''^08[|IKM09I : hence, unless P=NP, for any efficiently computable upper bound on the 



entangled value, there are necessarily games whose entangled value is strictly less than 1 yet for 
which that upper bound is 1 (and such games can often be exhibited explicitly without relying on 
P^^NP). We note that some of the early parallel repetition results for the classical value liFL92 1 fol- 
lowed the same route (of upper bounding the value by a semidefinite program) and were limited 
to special classes of games for the exact same reason. 

To summarize, no parallel repetition result (not even one with very slow decay) is known for 
the entangled value of general games, and, moreover, the known techniques are unlikely to extend 
to this case. Hence the natural question: 

Can parallel repetition decrease the entangled value of games? If so, can we bound the rate of 
decrease? 

In parallel to work on the parallel repetition problem, the related question of product testing 
arose in the context of error amplification for PCPs IIDR061 iDGOSl |Imp08} IIKW09II . Roughly speak- 



ing, the question here is to design tests by which a referee can check that the players play according 
to a product strategy, i.e., answer each question independently of the other questions (as one would 
expect from an honest behavior). Note that if the players are constrained to follow a product strat- 
egy, then their maximum winning probability must necessarily go down exponentially, hence the 
connection to the parallel repetition question. The result of Feige and Kilian l|FKOO|l mentioned 
above in fact also shows that the strategy of the players must have some product structure, and 
recently there has been lots of renewed interest in this question leading to much stronger product 
testers IDMIOL In the case of entangled players, however, absolutely nothing was known: 

Is there a way to test if the strategy of entangled players is in some sense close to a product 
strategy? 

1.2 Our results 

In this work we answer both questions in the affirmative, and our main result can be informally 
stated as follows. 
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Theorem 1 (informal). For any s < 1, 3 > 0, and entangled game G, there is a corresponding i-parallel 
repeated game G', where I = poly((l — s)^^, 5^^), such that if the value ofG is less than s then the value 
of G' is at most 5, whereas if the value of G is 1 then this also hold^for the repeated game. 

The dependency of £ on ^5 in our theorem is polynomial, whereas as we already mentioned it 
is known that in some cases this dependence can be made poly-logarithmic (and this is certainly 
the case if the players are assumed to play independently). While a poly-logarithmic dependence 
is important in some applications for which one would like to perform amplification up to an 
exponentially small value, in many cases the main use of parallel repetition is to amplify a small 
"gap" between value 1 and value 1 — 1/ poly{\G\) to a constant gap, say between 1 and 1/2. In 
this case the polynomial dependence of £ on (1 — s)^^ that we obtain is optimal (up to the exact 
value of the exponent). 

In the course of the proof of this theorem we also establish that the player's strategies have 
a certain "serial" or "product" structure (more on this in the proof ideas and techniques section 
below). The informal statement above hides some details, which we now discuss. The kind of par- 
allel repetition we perform depends on the structure of the game G, and we distinguish whether 
it is a projection game or not. 



Repetition for projection games. If G is a projection game, then the repeated game is obtained 
by independently playing the original G on a subset of the repetitions, and playing dummy rounds 
in the other repetitions. We note that projection games form a wide class of games that captures 
most of the games one typically encounters in the classical literature (see IIRaoOSII ). 

If, in addition, the game happens to be a free game (i.e., a game in which the referee's distribu- 
tion on question pairs is a product distribution), then the dummy questions are no longer needed 
and hence our analysis applies to the standard ^-fold repetition. 



Repetition for general games. If the game G does not have the projection property, then it is 
necessary to add a number of consistency rounds to the repetition. In those rounds the referee sends 
identical questions to the players, and expects identical answers. As before, the other rounds of 
the repetition are either the game G or dummy rounds. The consistency questions are added to 
play the role of the projection constraints. 

This kind of repetition raises the following issu^: namely, it is not obvious that honest entan- 
gled players can answer the consistency questions correctly. This implies that, even if the original 
game had value 1, players might not be able to succeed in the consistency questions and hence the 
value of the repeated game might not equal 1 anymore. This may or may not be an issue depend- 
ing on where the original game comes from. In many cases it is known that, if there is a perfect 
strategy, it does not require any entanglement at all, or it can be achieved using the maximally 
entangled state. In both cases it is not hard to see that players will be able to answer consistency 
questions perfectly, and hence our result holds. Because of this we regard this issue as a minor 
one; however it might be important in some contexts. 



^See the discussion following the theorem for some caveats. 

*This is why we treat the projection case separately, despite it leading to similar decay. 
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1.3 Proof idea and techniques 

We focus on the case of projection games, as the proof of the other cases does not present additional 
challenges. The starting point of our proof is the work of Feige and Kilian IIFKOO|l . for which the 
following intuition can be giverH. Our goal as the referee is to force the players to use a product 
strategy, preventing any elaborate cheating strategies. In other words, we want to make sure that 
the player chooses his answer to the ith question based only on that question and not on any of 
the other £ — 1 questions. Towards this end, the referee chooses a certain (typically large) fraction 
of the £ question pairs to be independently distributed dummy questions, the answers to which are 
ignored. These dummy questions are meant to confuse the players: if they were indeed trying 
to carefully choose their answer to a certain question by looking at many other questions, now 
most of these other questions will be completely random and uncorrelated with the other player's 
questions, so that such a strategy cannot possibly be helpful. 

In more detail, Feige and Kilian prove the following dichotomy theorem on the structure of 
single-player repeated strategies (that is, maps from ^-tuples of questions to ^-tuples of answers): 
either the strategy looks rather random (in which case the players cannot win the game with good 
probability — this is where the projection property is used) or it is almost a serial or product strat- 
egy, i.e., the answer to each question is chosen based on that question only (in which case the 
player is playing the rounds independently, and his success probability will suffer accordingly). 

Our proof follows a similar structure. However, an important challenge immediately surfaces: 
the proof in IIFKOOL and indeed all proofs of parallel repetition theorems or direct product tests, 
make the important initial step of assuming that the player's strategies are deterministic (which 
is easily seen to hold without loss of generality). And indeed, it is not at all trivial to extend those 
proofs to even the randomized setting without making this initial simplifying assumption. To 
give a simple example, an important notion in Feige and Kilian's proof is that of a dead question — 
simply put, a question to which the player does not give any majority answer, when one goes over 
all possible ways of completing that specific question into a tuple of questions for the repeated 
game. It is easily seen that, in the case of a deterministic strategy, dead questions are harmful, as 
the players are unlikely to satisfy the projection property on them. However, it is just as easily 
seen that for most randomized strategies, good or bad, all questions are dead. 

This illustrates the kinds of difficulties that one encounters while trying to show parallel rep- 
etition in the case of entangled players, when one cannot simply "fix the randomness". The issue 
we just raised is not too hard to solve, and others are more challenging. Indeed the main difficulty 
is to define a proper notion of almost serial for operators, which would in particular incorporate 
the inherent randomness of quantum strategies. It turns our that the right notion is the notion 
of consecutive measurements (rather than tensor products of measurements for each question, a 
tempting but excessively strong possibility). Based on a quantum analogue of Feige and Kilian's 
dichotomy theorem, we are able to show that the almost serial condition induces a condition of 
almost orthogonality on the player's operators. At this point we need to prove a genuinely quan- 
tum lemma, which lets us extract a product strategy from the almost-orthogonal condition. This 
novel orthogonalization lemma is at the heart of our proof. We obtain that the players approximately 
perform a series of consecutive measurements, each depending only on the current question. An 
upper bound on the value of the repeated game then follows. 

^ We refer to Ryan O'Donnell's excellent lecture notes IO'D05bllO'D05al for a helpful exposition of Feige and Kilian's 
proof. 
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Organization of the paper. We start with a few definitions, including a description of the form 
of the repeated games that we consider, in Section |2l We then give a high-level overview of the 
structure of the proof, and the main ideas governing it, in Section |3l Section H] contains the proof 
of our main theorem. Finally, Section |5] contains the proof of an important technical component 
of our proof: an approximate joint block-diagonalization of positive matrices which are close to 
being orthogonal. Appendix|A]contains a few additional useful technical facts. 

2 Preliminaries 

2.1 Games 

In this paper we study two-player one-round games. Let Q and A be finite sets. An entangled 
game (or simply game) can be defined as follows. 

Definition 2. An entangled game G = {V,ti) is given by a function V: A^ x — > {0,1} and a 
distribution n: —> [0,1]. The referee samples questions {q',q) according to n, and sends q' to the first 
player and q to the second player. He receives hack answers a', a respectively. He accepts those answers if 
and only ifV{a', a | q', q) = 1. The value of the game is 

cv*{G)= sup X] E ^('?'''?)^(«''«k'''?)Cf|^J ^^^^l^f) 

|Y),A,,B, {q',q)GQ2 {a',a)<^A^ 

where the supremum is taken over all finite-dimensional Hilbert spaces T-L, all a priori shared states | Y) G T-L 
and all Projective Operator-Valued Measurements (POVMsEa^, = {A^',} a'(zA and = {Bg}a£^ on T-L. 

We note that by standard purification techniques (see IICHTW04II ) one can assume that for each 
question q each player performs a projective measurement with outcomes in A (i.e., YlaGA — 
and (A«)+ = Al = (A«)2). 

We will be interested in some special classes of games. 

Definition 3. A game = {V,n) is called a 

• Projection game if for every q',q E Q and a' G A, there is a unique a E A such that V{a',a\q', q) = 
1. 

• Free game if re = tta x ttb is a product distribution. 

• Symmetric game if n is symmetric, and for any q' ,q, a' , a we have V {a' ,a\q' ,q) = V{a,a'\q,q'). 

2.2 Repeated games 

We consider two different types of repeated games. The first one, originally used by Feige and 
Kilian, applies to projection games, and we describe it in Definition HI The second type of repeti- 
tion applies to consistency games, and is closer to the direct product testing technique originally 
introduced by Dinur and Reingold IIDR06II : we explain it in Definition|5l 

'^The POVM condition states that each A"^, > 0, and J^^, A"^, = Id. 
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Definition 4 (Feige-Kilian repetition). Let i be any integer, and define Ci := and C2 '■= i — Q. 

Given a two-player projection game G = {n,V ,Q, A), its l-th Feige-Kilian repetition is the following game 

• The referee picks a random partition [£] = M U F, where \M\ = Ci and \F\ = C2 = I — Ci. Indices 
in M will be called "game" indices, while indices in F will he called "confuse" indices. 

• The referee picks {q'j^, qu) ^ {Q >^ Q)*^^ ■ 

• Hepicks {q'p,qF) ~(7T^x7rfj)'^2 (Q x Q)'~^, where rcj^ is the marginal of tt on the first player, and tzb 
the marginal on the second player. 

• The referee sends the questions to the players (without specifying which questions are of which type). 
On game questions he verifies that the original game constraint is satisfied. He accepts any answers 
to confuse questions. 

Definition 5 (Dinur-Reingold repetition). Let i be any integer, and define C[ := £^^^, Ci = 2C[ and 
C2 '■= i — Ci. Given a two-player symmetric game G = {n,V ,Q, A), its i-th Dinur-Reingold repetition 
is the following game Gjj^gy 

• The referee picks a random partition [I] = R U G U f , where \R\ = C[, |G| = C[, and \F\ = C2. 
Indices in R will be called "consistency" indices, those in G will be called "game" indices, and those 
in F "confuse" indices. 

• The referee picks C[ questions q^ ~ c' Q*"^ ^nd sets q'^ = qj^, where Ua is the marginal of n on the 

first player (since we assumed G was symmetric, this is the same as ttb, the marginal on the second 
player). 

• The referee picks C[ pairs of questions {q'Q,qG) ~ c; (Q x Q)^'^- 

• Hepicks {q'j:,qf) -^i^^^^^^f^ (Q x Q)^\ 

• The referee sends the questions to the players (without specifying which questions are of which type). 
On consistency questions he verifies that both answers, from Alice and from Bob, are identical. On 
game questions he verifies that the original game constraint is satisfied. He accepts any answers to 
confuse questions. 

Note that, if a game G has value 1, then its Dinur-Reingold repetition does not necessarily also 
have value 1, as the player's optimal strategy in G might not be consistent. A consistent strategy 
is one in which whenever the players are asked the same question they provide the same answer 
with certainty. This may not always hold of an optimal strategy; nevertheless the following lemma 
shows that we can assume it holds in some natural settings. 

Lemma 6 (Lemmas 3 and 4 in llKKM+08ll ). Let G = {V,n) be an arbitrary 2-player entangled game. 
Then there exists a game G' = {V , n') of the same classical and quantum values with twice as many 
questions, and such that n' and V are symmetric under permutation of the variables. Moreover, given 
any strategy Pi,...,Pn with entangled state |Y) that wins G with probability p, there exists a strategy 
P[, . . .,P'j^ with entangled state that wins G' with probability p and is such that P{ = ■ ■ ■ = Pj^ 
|Y') is symmetric with respect to the provers 1, . . . ,k. In addition, if |Y) zvas a maximally entangled state 
then |Y') is also. 
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This lemma shows that, if G is any game, then we may symmetrize it and assume that the 
provers are also playing according to a symmetric strategy. In particular, if G had value 1, and the 
optimal strategy used either no entanglement or a maximally entangled state, then this also holds 
of the optimal strategy in the symmetrized game. Such a strategy is automatically consistent. 

3 Proof overview 

We first give a formal account of our results in the next section, before proceeding to give an 
overview of their proof in Section l3!2l 

3.1 Results 

We first state our main theorems. They refer to the two types of repetition of an entangled game 
G defined in the previous section, its £-th Feige-Kilian repetition Gpj^(^(y and its £-th Dinur-Reingold 
repetition G^j^i^gy Both types of repeated games are made of £ independent rounds, played in 
parallel. Some of these rounds consist of independent repetitions of G, while others are either 
confuse or consistency rounds, containing simple tests independent of the original game (except 
for the distribution with which questions are chosen in those rounds). Our first result pertains to 
projection games. 

Theorem 7. There exists a constant c > 1 such that, for all s < 1 and S > there is a i = 0{{S^^ (1 — 
s)^^y) such that, if G is a projection game with value u)*{G) < s, then the entangled value of the game 
Gfk(C) ai most 5. Moreover, if the value of G is 1 then the value ofGp^^-j is also 1. 

In the case of free projection games, questions to the players are chosen independently, so that 
the distribution on questions in the confuse rounds of the game Gp^f) is exactly the same as that 
in the original game. The only difference is that in such a round, all answers are accepted, which 
can only help the players. Hence the direct parallel repetition of G has a smaller value than its 
Feige-Kilian repetition, which implies the following. 

Corollary 8. Let s < 1 and S > 0. Then there is a £ = 0{{S^^ (1 — s)^^y) such that, if G is a free 
projection game such that a;*(G) < s, then the (direct) £-fold parallel repetition ofG has value at most 5. 

Our second result is more general, as it applies to arbitrary games. It only comes with the mild 
caveat that, in order to preserve the fact that the original game had value 1 (whenever this indeed 
holds), it is required that in that case there also exists a perfect strategy which is consistent. 

Theorem 9. There exists a constant c > 1 such that, for all s < 1 and S > there is a £ = 0{{S^^ (1 — 
s)^^y) such that, if G is an arbitrary game with value co*{G) < s , then the entangled value of the game 
Gdr(£) at most 5. Moreover, ifG has a perfect consistent strategy then the value ofG^^^ is also 1. 

Lemma [6] shows that the requirement that G has a perfect consistent strategy (which is only 
a requirement in cases where we are interested in preserving the fact that G might have value 1) 
is satisfied for many examples of games, including those for which we know a priori that, if the 
value of G is 1, then there is an optimal strategy that either does not use any entanglement at all, 
or uses the maximally entangled state. 
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3.2 Proof overview 



In the remainder of this section we describe the main ideas behind the proof of Theorem [7| and 
Theorem |9l full details can be found in Sections |4] and |5l Our goal is to understand repeated quan- 
tum strategies, that is, maps ci ^ ^ }a£A' which map tuples of questions q = {qi, . . . ,q() 
to projective measurements { ja^^/ in dimension d. The semantics are that, on receiving ques- 
tions q, a player measures his share of the entangled state | Y) according to { IneA'' resulting in 
him sending back answer a with probability (Y| Id X^ | Y) . Interestingly, most of the proof will be 
directly concerned with the measurements { XJJ }„g^f themselves (together with the reduced den- 
sity p = Try\ |Y) (Y|), without reference to the other player's measurements or even the underlying 
game. 

We will be interested in a strategy's marginals: given a fixed subset of indices S C [£] and a set 
of questions qs on the indices in S, one can define the marginalized measurement 



aGAn\S 



which corresponds to choosing a tuple q E QI'^I^^ by picking the question in each coordinate 
independently according to some fixed distribution 7r|3 making the measurement corresponding 
to the POVM described by { Xg^JJ }{as,a)GA^' ^^i^ marginalizing over those answers a corresponding 
to indices not in S. 

Given that X was a projective measurement, the marginalized strategy is a POVM — it is not 
necessarily projective any more. Our main results will pertain to the structure of such marginal- 
ized strategies. We will show that they are either very random (this is formally called dead later 
on, and morally means that the marginalized strategy is very far from a projective measurement; 
rather its singular values tend to be small and spread out), or highly structured (this is called se- 
rial later on, and after some work we will show that it implies that the marginalized strategy has 
somewhat of a product form, i.e. it can be decomposed as a product Ylq\ ■ ■ ■ Ylql on a subset of the 
coordinates). The attentive reader might already see that once this is proven it will be possible 
to bound the success probability of both types of strategies in the repeated game; however we 
should warn that the exact statements, and their proofs, are quite technical and carry only a fair 
share of the intuition we have just given. 

We proceed to give a few more details on the structure of the proof of our results. It can be 
divided into three main steps. The first two steps establish facts about the structure of repeated 
single-player strategies, and are independent of the game being played, as well as of the other 
player's strategy. 



Step 1: A quantum dichotomy theorem. In the first step we prove an analogue of Feige and 
Kilian's dichotomy theorem IIFKOOL The precise statement is given in Lemma [l2l and its simple 
proof very closely follows that of Feige and Kilian's theorem. Informally, it states that there exists 
an integer 1 < r* <^ £, such that a tuple of questions (R, (Jr), where R C [i] denotes a subset of r* 
indices, and qR fixed questions in those positions, can be of two types only. Either it is dead (case 1 
in the lemma), or it is (1 — rj)-serial, where > is a small parameter (case 2 in the lemma). Both 

''We will often drop the reference to n and simply write [■]. n will be fixed throughout, and later instantiated to 
the (marginal) distribution on questions from the original game G that is being repeated. 
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types of strategies are precisely defined in Definition [TTl and the meaning of dead is the easiest to 
grasp. The technical definition is simply that the (marginalized) measurement {^qR}ai^GA'^' when 
performed twice (sequentially) on the same hal^ of the state |Y), is unlikely to produce the same 
result. This kind of strategy is easily seen to be bad for the players, as is shown in step 3. of the 
proof. 

Serial strategies are more subtle. In the case of a classical deterministic player, a serial strategy 
is such that, when one conditions on the player giving answers Ar to the questions in R, the 
answers to most other questions (not in R) are for the most part determined by the player as a 
direct function of the corresponding question, i.e. he is playing an honest product strategy on 
those coordinates. In the quantum case, we will adopt a seemingly weaker definition, which is 
that a strategy is serial on ((?«,««) if, in expectation over the choice of an additional question qj 
in position /, when the marginalized measurement {Xq'^q'i}(^ag, a )ga'^li{!} is performed twice on the 
same half of [Y), the probability that the same answer (ar, fl,) is obtained twice is almost as large 
as the probability that just Ar is obtained twice: conditioned on being consistent on the answers to 
the questions in R, the strategy is also consistent in its answer on a random additional question qj 
in position 

Fleshing out the consequences of this definition to eventually show that it implies something 
close to the classical definition requires some work, and is the object of the second step of the 
proof. 

Step 2: A product theorem for serial strategies. While for a classical deterministic player a 
serial strategy, as defined in the previous section, is one which decides on the answer a, to most 
questions qi not in R as a function of that question alone, in the quantum setting this is much less 
clear. The first task is to decide on what one expects from a serial strategy. For instance, one might 
ask for the measurements to take some "approximately-tensor" form; however we find that this is 
too strong a requirement. Instead, we first show that the serial property implies that the player's 
measurement operator {X^r!j| }(„j^ has a certain block-diagonal form, in the sense 

where {njj| }„ are orthogonal projectors; the precise statement is given in Claim[l6l Its proof goes 
through a technical statement about sets of operators which are close to being pairwise orthogonal. 
That statement, proven in Lemmal23l shows the natural fact that such operators are close to having 
a common block-diagonalization basis. 

Once this is shown it is not hard to extend the approximation to a small number of additional 
questions qi, . . . , qg, showing that the corresponding measurement also has a block-diagonal form, 
this time described by the product of the corresponding projectors IF^J ■ • ■ nj|'^; a precise statement 
is given in LemmalTTl It is in the precise sense described in that lemma that we can say that a serial 
strategy has a product form, based on which we can think of the player as playing sequentially on 
a subset of the coordinates. 

^In fact we will also need to consider the outcome of performing the same measurement simultaneously on the two 
halves of 

^Note that this "approximation" should be taken with a grain of salt; in particular one cannot expect to extract any 
information about the measurement operators themselves simply by observing statistics of measurement outcomes. 
Rather, all our estimates will bear on the post-measurement state, resulting from applying the measurement corre- 
sponding to X'^'^'^ii to one half of 
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Step 3: Both dead and serial strategies fail the repeated game. In the last step of the proof 
we show that both types of strategies, dead or serial, must fail in the repeated game with high 
probability (provided the value of the original game was bounded away from 1). For the case of 
dead strategies this is fairly intuitive: since a dead strategy does not assign consistent answers 
to a certain subset of the questions qR, this implies that the player's answers in positions R will 
very much depend on the questions present in those indices not in R; not only that but it will 
be virtually impossible for the other player to correlate well with this player's answers on those 
indices. Here we crucially use the "projection", or "consistency" rounds of the repeated game in 
order to show that such strategies will fail in those rounds with high probability. This is proven in 
Claim [H 

The case of serial strategies is slightly harder to analyze, but it boils down to showing that 
the block-diagonal form we described earlier roughly implies that we can in fact see one of the 
players as making a sequential measurement governed by the Ylq'.. Since in this case the player's 
answer to question qi is decided by applying a projective measurement depending on qj alone, 
in case the original game had a value s < 1 such a strategy will fail in at least a fraction s/2 of 
the "game" rounds with high probability, and be caught by the referee provided there are enough 
such rounds. This is shown in Claim [l9l 

Finally note that the "confuse" rounds of the repeated game are not used in this stage (and in- 
deed the referee accepts any answers in those rounds), but they are crucial to show the dichotomy 
lemma and the following claims, which only hold for strategies which have been marginalized 
over a sufficiently large number of questions; in order to be able to perform this marginalization 
it is important that questions to the players in the confuse rounds are picked independently. 

4 Proof of the main theorem 

In this section we give the proof our main results. Theorems [7| and |9] It is divided in three parts. 
The first, in Section l4!2l establishes our "quantum dichotomy theorem". The second, in Section l431 
investigates the structure of serial strategies, and shows that they admit a certain block structure. 
The results in this section are based on our "orthogonalization lemma", which is proved separately 
in Section |5l Finally, in the third part. Section I4.4[ we use the results from the first two parts to 
bound the success probability of the players in the repeated game. 

Because of the nature of repeated strategies, which are indexed by large tuples of questions and 
answers, we are constrained to use rather heavy notation. We explain it in detail in the following 
section, which can also serve as a reading guide for the statements that are to follow. 

4.1 Notation 

Recall that for every q G Q^, {X^ }aGA' ^^i arbitrary projective measurement in d dimensions, that 
is, the are projector matrices, and for any fixed q they sum to the identity over a. The position 
of the questions (or answers) in a tuple will always be fixed and usually clear from the context; 
for example when we write q = {qc^qp), where G, F C [£] are sets of indices, it is not necessary 
that the questions qc are placed before the questions qp in the tuple q; rather their position is 
determined by the indices in G, F. When precision is needed we shall write (f, qi) to express the 
fact that question qi is destined to appear in the i-th position of some tuple q. We also write q^, to 
denote tji, . . . , qi+i, ...,qi. 
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We will often consider marginalized POVMs over a certain set S C [£]. Given questions qs 
indexed by S, the marginalized POVM is the POVM indexed by answers as, which results from 
applying {X^jgjaja for a random q E QI^^^^, and ignoring the answers a not in S. More precisely, 
given (S, qs, as) it will be convenient to work with the Stinespring representation 



vasa 



iX) {q,a\ 



where E is an extra register of the appropriate dimension, and n denotes an arbitrary distribution, 
fixed throughout (it will later be instantiated to the marginal distribution that arises from the 
original game G that is being repeated). This definition satisfies, for any p > 0, 



where the identity Id^ was created on the additional register E introduced in the definition of 
and the expectation is with respect to the distribution n. In order to make measurements 



corresponding to marginalization over different sets S, we will assume that the register E is always 
of large enough dimension, and if necessary X^| is tensored with 



, = En „ ((?, fl I on the extra 

2|S| registers. Note that there is nothing in the definitions above that require the questions and 
answers in X^g to be indexed to the same set, hence we extend them to define X^J, for T Q S Q [£], 
in the obvious way. 

For any p > 0, we write Trp{A) for Tr{A{p ( 



Id^)), so that in particular 



^^qsq) 



where we define 



X"^ 
9s 



Terms such as Tr (X^Jp) will frequently appear on the right-hand side of our inequalities, and they 
should simply be considered as normalization factors, accounting for the (possibly unnormalized) 
underlying state p, and the conditioning on a fixed flj. Finally, given p > and a matrix A of 
appropriate dimension, we introduce the semi-norm 



\A\ 



Tr{Ap 



i/2^y/2) 



(1) 



Note that || • |]p is definite only if p has full rank. We will mostly use this norm for notational 
convenience. At this point it suffices to observe that it derives from a semi irmer-product, so that 
it satisfies the Cauchy-Schwarz inequality. 

At a first reading it may be helpful for the reader to consider the special case of the totally 
mixed state p = d^^Id; putting the notation in context this corresponds to the players sharing the 
maximally entangled state. In this case very little of the above is really needed, and in particular 
Trp((XqJ)+X^j) is simply the normalized trace d-^Tr{X"^l^)]. Many of our statements are 

easier to prove, and to understand, in this setting (the main cause of simplification being the 
commutation between p and the X operators), so that the reader may wish to consider it first. 
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4.2 A quantum dichotomy theorem 

In this section we prove two important lemmas on the structure of any quantum strategy in a 
repeated game. The main lemma. Lemma [121 is the analogue of Lemma 11 in IIFKOO|l . It establishes 
a dichotomy between two different types of strategies that a player can use, showing that either 
the strategy is very random, or it must have a relatively strong sequential structure. Its proof 
follows that of the classical setting without too much added difficulty, provided the definitions 
are made correctly — which we now proceed to do. 

A crucial difficulty in adapting Feige and Kilian's argument is to define an appropriate mea- 
sure of a strategy's unpredictability. In the classical case of a deterministic strategy, this can be 
measured through the entropy of the marginalized distribution on answers; however in the quan- 
tum or even the randomized setting such a measure is no longer helpful, as even honest product 
strategies can be very random, just by being convex combinations of distinct deterministic strate- 
gies. Instead, we measure unpredictability as follows. 

Definition 10. Given a strategy X^, a state p, and a fixed set of questions qR in positions R C [i], define 
the collision probability of X on q^ as 

Pcoiiul^'p) := E^co;('?R/flR|X:,|0) (2) 

where 

Pcoi{qR,aR\X,p) := [Trp{{X"^iy X^^liX'^l)' X'^^^^ (3) 

To understand this definition, first consider the case when p is the totally mixed state d^^Id. In 
this case both terms inside the summation are equal to the normalized squared Frobenius norm 
d-i||X^«||2. Expression © can be interpreted in two different ways. From an operational point 
of view, it corresponds to the probability that one obtains twice the same answers when one se- 
quentially performs a measurement using the POVM with elements {X^^ joj; ■ In this sense, Pcol is a 
measure of the predictability of the strategy X"^: pick two completions q, q' at random and measure 

using first {X^q^qja^a and then using {X^^^'j^j^a/; Pcoi('?r|X|0) is the probability of getting twice the 
same result Ar (and ignoring the other answers a, a'). The analytic interpretation is that this is a 
measure of the entropy of the spectrum of Xj^^, which is maximized when X^J^ is a projector (for a 
fixed value of the trace). 

In case p is not the identity, and hence does not commute with the X'^, we need to adopt 
the more cumbersome definition (|2]| for technical reasons. However, note that the operational 
interpretation remains — the first term on the right-hand side of (|3]) is the probability of obtaining 
the same answer when performing the measurement twice on the same half of while the 
second term is the same, when the measurement is performed on the two different halves of 
indeed, note that Tr(X^^« pi/2) = (y|X«^ ^ (Xq^«)^|Y)0 

The following lets us make the distinction between the two different types of strategies alluded 
to above. 

Definition 11. We will say that: 

^"Note the transpose sign, which indicates that our interpretation is only rigorously correct for the case of real sym- 
metric X. 
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• A block (R, qR) is e-dead ifPcoi{cjR\X,p) < e. If a block is not c-dead it is e-alive. Moreover, we say 
that the answer ar is £-alive if it satisfies 

Pcoi{qR,aR\X,p) > eTr{X;ip) 

Note that any e-alive block has at least one e-alive answer. Sometimes we will simply say that a block 
or an answer are alive or dead, leaving the parameter e implicit. 

• A block {R,qR,aR) is (1 — r])-serial if aR is alive and the following holds: 

^(i,qi)[PcoMR,m\X,p)\ > {1- t])Pcoi{qR\X,p) (4) 

Lemma 12. Assume that e,rj > are chosen such that rj e^ > 16 C^^^^^ Then one of the following holds 

1. At least fl (1 — e) fraction of blocks {R,qR) are e-dead. 

2. At least an e fraction of blocks {R, qR) are e-alive, and moreover if {R, qR) is an e-alive block then 

^ Tr{x;ip) < e/2 (5) 

^^R'^R^ ;s not (1 — )j)serial 

i.e. alive answers which are not (1 — fj)-serial have a small probability of occurring. 

Proof. We extend the definition of the coUision probability to measuring colHsions over answers 
which are not necessarily on the same indices as the questions: 

P,M%X,p) := 5:(rr,((X«'^)^X«'^(X«'^)+X««)+Tr(X^"V''%"V''')) 

where now q can be any subset of fixed questions, and R denotes the subset of answers on which 
we are measuring the collision probability. 

Claim 13. There exists an integer 1 < r* < Ci such that 

Er,cj^ [PcoliqR\R,X,p)]- ER,^^,,^^[P,a{qR,qi\R[j{i},X,p)] < SC^^^^ 
where the expectation is taken over all subsets R of size \R\ = r*. 

Proof. There is a similar statement in IIFKOO|l . Here we closely follow the proof of Corollary 3.2 in 
the lecture notes IIO'DOSb l: since the argument is very similar (mostly replacing the use of Fact 1.3 
in those notes by our Claims |26] and |29l) we only outline it here, leaving the details to the reader. 
The proof goes by considering what happens to the collision probability when one conditions on 
an additional question, resp. one considers collisions over an additional answer. First, note that if 
one extends R by an index i, then PcoK'?!^ U {/},X,|0) < Pcoi{q\R,X,p), since obtaining identical 
answers on R is a necessary condition to obtain identical answers on R U {/}. The following 
equation is the analogue of Fact 1.4 in llO'D05b|l : 

[Pcoi{q,qt\R,x,p)] - p,oMR,x,p)\ < 4Cf^/' (6) 

^^Recall that Cj, C2 are chosen so that Ci + C2 = i: see Definitions |4] and |5] for more details. 
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The proof of (O follows directly from Claims |26] and |29l and we omit it. It shows that the colli- 
sion probability cannot increase by too much when one conditions on an additional question, in 
expectation. The proof of the claim is then concluded exactly as in the classical case: consider a 
sequence of steps in which one successively looks for collisions on an additional coordinate i, and 
conditions on an additional question qi. In expectation over the choice of (/, qi), P^oi will never go 
up by more than 4C^ when one performs this operation. Since Pcoi is always between and 1, 
the fact that it never goes up by much implies that there must be a step in which it doesn't decrease 
by more than 8C^^^^: the total decrease cannot be larger than the total increase plus 1. r* is chosen 
so that this step occurs when r* indices (and questions) have already been fixed. □ 

Towards a contradiction, assume the negation of both 1. and 2. With probability at least e 
a random block (R, (Jr) is alive, and moreover if (R, ^Jr) is alive then alive answers which are 
not (1 — ?/) -serial have a significant contribution. Fix such an answer Ur. Since (H)) is not satisfied, 
summing over all which are alive but not (1 — ;/) -serial one can see that the collision probability, 
for this (R, (Jr), must decrease by at least 

V Pcoi{qR,aR\X,p) 

aR '.iiR alive but 
(^R'^r) ^ i/)-serial 

By the negation of (O and the fact that the answers are alive, this quantity is at least rje^ / 2. Finally, 
taking the expectation over the choice of (R, (?r) gives a total decrease in P^oi of at least t]c^/2, 
contradicting Claim [B] if f/e^ / 2 > 8 C^; ^ ''^ . □ 

4.3 Serial strategies 

The main result of this section is Lemma [17[ which shows that serial strategies have a product 
structure. Given that most of the strategies that we consider in this section will have a fixed 
and flR, we introduce the useful notation Y^l := Xq^q^^ (resp. Y^l := X^^g^^) for any S C [£]\R; the 
value of qR and ar should always be clear from the context. We will also simply write Y for X^^ 
(resp. y for X^^). For the totality of this section > is a fixed parameter, which one can think of 
as polynomial in the soundness 3 that we are aiming for in the repeated game. 

We start with a simple fact which expands on the defining property of (1 — ?/) -serial strategies. 

Fact 14. Let qR G Q^. For every ur G there exists aa^, > Tr(Xq^jo) such that J^^^^ ccaf, < 3 and the 
following holds. Suppose (R, qR, ar) is (1 — rj)-serial, and assume that rj > C^^^"^. Then for a fraction at 
least (1 — rf^^"^) of all [i,qi)for i ^ R we have that 

< Trp(y;y,,yjyq,) -X:Tr,((y;;f y^^^^^^ < ^rf'^^a, (7) 
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Proof. By condition (H)) in the definition of (1 — ?/) -serial, the Y^' satisfy 



Tr{Y p^'^Y p"^) - E Tr(y;; p"^Y'^i p'/^) 



L/2 



< 7(Trp(y^yy+y) +Tr{Yp'^^^Yp^^ 
For any fl^ G A^, let 

max (rr(x;V),r [I^^^p - Tr,((X;^,/x;\.X;\) | 



By applying Claim |29] to the X^^,^ we obtain 



|T^r,((X;«)%"^x;^)-Tr,((X;\0'x;\x;^J| < 2q-^/2Tr(p) 



(9) 



(10) 



which, by using our assumption that C2 < f] and Tr{X'^'^p) < Tr{p), implies J^^'^ a„i^ < 
3Tr(p) < 3. Applying Claim l26l to the Y^ we also obtain 



Tr{Yp'^^Yp'/')-Tr{Y,,p'/%p'/')\ < rj cc 



Or 



Hence (HJ, together with an application of Markov's inequality, implies that, for a fraction at least 



< r]^'^ (^rp{Y^ yy+y) +Tr(y|0^/2y|oi/2) + I1 



■an 



By expanding out the Yq. terms, one can verify that both terms on the left-hand-side of this equa- 
tion are positive, hence each of them must be smaller than the right-hand-side, itself smaller than 
4 rf^'^a.a„ . This proves both and dH). □ 

We now prove a simple claim which shows that (1 — //) -serial strategies are close to being 
orthogonal; this is how we will subsequently exploit that property. 

Claim 15. Let G Q^. For every ar G there exists a„j^ > Tr[X'q^^p) such that Y,af. ^ 3 and 
the following holds. Suppose that {R, qR, ar) is (1 — r])-serial. Then for a fraction at least (1 — Vj^^'^) of all 
{i,qi) fori i R, 

(11) 



where pa, = p^''^Y^'.p^^'^. 
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Proof. Define as in GO]). Letting Z,- = Y+ (Y^T^ )y,,. - E«, (t;;)^Y,^* (Y;;)+ Y^;, Eq. © from FactHl 
can be re-written (for the [i, qi) for which it holds) as 

Trp{Zi) < 4j/3/4a„^ 
Let Pi := Pa,, where = p^^^Y^;p^^^. Since < p and Z, > 0, we get 

rrp,(ZO < Trp{Zi) < ^^'^aa^ 

and hence, expanding out Z,, 

E < (12) 

Finally, we can use (O to upper-bound 

where we used Za' ^qii^liy ^ Together with this proves the claim. □ 

Claim 16. Let G Q^. for every ar G f/iere exists oiaf, > Tr(^X'^'^p) such that J^^r ^ 3 and 
the following holds. Suppose that (R,(jR,flR) is (1 — r])-serial, let 1 < g < Ci/2 be a fixed parameter, and 
{G,qG) chosen at random under the constraint that G n R = and \G\ = g. Then with probability at 
least (1 — j/^''^ — e^^^) over the choice of {G,qG), there is a partition G = G' U G" , where g" = \G"\ > 
(1 — 4?/'^/^) g, such that for every i G G" 

X:Tr,,((Y;^)+(W-n«,;.)Y;^) < 0{g r^'/^^) (13) 

where for i G G", {n5|j}„, is an orthogonal measurement depending only on q^, ar and qi (it is independent 
of the particular choice of (G, qc)), pG = P^^^^qcP^^^' ^^'^ c > 0, C2 > 1 are universal constants. 

Proof. Since {qR,a^) is (1 — ?/)-serial, we can apply Claim [151 to obtain that a fraction (1 — tj^^'^) of 
{i,qi) satisfy 

E ^..((^ (^^)'^^) < (14) 

where as before pa^ = p^^'^Y^Ip^^^. We can now apply Lemma |23l to the Y^'. (with the states p^,) to 
obtain, for the fraction (1 — rj^^'^) of (/, qt) considered above, orthogonal projectors {ngi ln, satisfy- 
ing 

Er''p.((^;;)'(^^-n?:)^;;) < oiv'^'^xJETripa,))''' ds) 

fl; (7; 

Moreover, the Ylq'. can easily be made into a projective measurement by enlarging one of them, 
so that they sum to identity; this will not harm the above bound. By Markov's inequality, with 
probability at least (1 — rj'^^'^) over the choice of (/,(?,) it holds that Tr(Y<,,p) < t]-'^'^Tr{Yp) < 
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rj^'^^'^a.af,. For any given {G,qG), let G" C G denote those indices i in G for which this property 
holds for {i,qi), and moreover {i,c]j) falls in the set of indices for which (|15)| holds. By the union 
bound and a Chernoff bound, the probability that |G"| < (1 — 4^'^''^)^ is less than e^^S^ and for 
ever i G G" we have 



(16) 



for some constant C2 > 0. Applying Claim |26] to the Yq'., and summing over fl;, we find that in 
expectation 



'-iG,qG) 



where we used < rj, pca, '■= P^^'^^qcP^^^' ^^id we think of the choice of {G, qc) as first picking 
{i, qi) and then the remaining positions and questions. Another application of Claim l26l combined 
with (|8]l shows that for every i G G", 



(G,<?g) 



Hence, letting pQ := p^^^Yi^^p^^^ = ^„ Pcaif combining the two previous equations we get 



3/4^ 



Using Markov's inequality, his lets us replace Y^'. by Y^'^ in ((T5)l for a fraction (1 — /y^^^) of (G, (Jg)/ 
losing an additional factor 0{gri^^^)iXaj^. Hence 



(17) 



where we safely assumed that C2 > 2. 



□ 



Lemma 17. Lef G Q^. For every G f/jere exfsis a^j^ > Tr(X^^|o) such that Y,^^ otaj^ < 3 and the 
following holds. Under the same conditions as in Claim^6\ except for a lower fraction (1 — 2t]^^^'^^ — e^^^) 
of {G,qc), it holds that 



E Tr,, ((y;f rt;f' - (t;f )^nv . . . h- . . . Hj;;y;-') < o(^^i/(«-)) 



(18) 



(19) 



Proof. Let {H^j} be the orthogonal projectors promised by Claim [161 Let g" = \G"\, and assume 
for simplicity that the first g" questions in G are those in G". To prove the first inequality, we 
show the following by induction on f = 1, . . . ,g": there exists a constant C > such that, if we let 
Fj = {1,. . .,/}, then 



^lR 



(20) 
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The statement for i = g" will imply ((18)) . Let Co be the constant implicit in ((13)1 from Claim [16) 
For i = 1, ((20)l is simply a re-statement of ((13)) , provided C is chosen larger than Cq. Assume the 
inequality verified for i — 1, and prove it for /. Write 

The first term on the right-hand side (when plugged back into ((20)) ) can be bounded directly us- 
ing ((13)) (and the fact that the projectors n^'' sum to identity over Uj, for S — 1}). Regard- 



ing the second, we can use the Cauchy-Schwarz inequality together with (|13)| to bound 



1/2 



By Markov's inequality, Tr(Yq^p) < r]^^^'^'^^Tr{Yp) for a fraction at least (1 — tj^^^'^^) of (G, (^g)/ so 
that for those indices the bound above can be replaced by 2\/Co^r}^^^'^'^^'>Dia,^. For the rest of this 
proof we only consider questions (G, qc) for which the bound Tr{Yq^p) < rj^^^'^^'^TriYp) applies. 
We can similarly obtain 



at. 

SO that 

Yjr,, ar/- - Y^)^n;; . . . n;; . . . n;;(y;^ - y^)) 



< E ^rpo{{%'a - ^;i)'ntJ • • • • • n;;,-(y;^' - y;^)) + \(.^c,^gn" ^'^-^ 

He, 



which can then be bounded using the induction hypothesis. This concludes the induction step, 
provided C > Cq + 16a/ Cq, and proves ((18)) . 

We now prove ((19)) . Use the Cauchy-Schwarz inequality to bound 

T V' -rr«i rr V' v«g" \ I 



E I ^'^PG^^qc, '^qG)^^qf ' ' ' njjj • • • ^yi'^qc ) \ 

< ( E ^^pMqf - ■ • ■ n?; - ■ ■ n;j;(y;r - %)) 

1/2 



PG I I ^ ?G / ^ ^ V ^ ^(Jl ^ ^ 9G 

«G" 

by (dD). We obtain ([19]) by noting that 

E Tr,, ((y;f ')^y;r - y;n;j; . . . n;; . . . nj;;y,,) = 



"G" 



since the Ylq'. sum to identity over a,. □ 
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4.4 Bounding the success of players in a repeated game 

We proceed to show how the results from the two previous sections can be combined in order to 
prove Theorems [7| and |9l For the remainder of this section we fix a game G with question set Q 
and answer set A, and consider the i'-repeated games Gp^£) and Gq^i^ for some fixed integer £. 
Let s be the entangled value of the original game G, and {A"^,}^' (resp. {{B'^)^}a) be an arbitrary 
fixed projective strategy for Alice (resp. Bob), using entangled state in the ^-repeated gamel^ 
Let p = Tya I (Y| be the reduced density of | Y) on Bob's subsystem. 

We note here that both types of ^-repeated games have the same overall structure, in that they 
consist of a set of Ci "correlated" rounds, in which the referee sends either "game" or "consis- 
tency" questions, and C2 "independent" rounds, in which he asks questions chosen independently 
from a product distribution (we refer to Definitions S] and |5] for more details, including the defini- 
tion of Ci and C2). In both cases, we can think of the referee as choosing the t pairs of questions in 
the following order. 

1. First, a subset R C [i\ of size r* < Ci/2 is chosen, and designated as indices for either 
game rounds (in the case of a projection game), or otherwise consistency rounds. Pairs of 
questions {q'^, qR) are then picked according to the appropriate distribution. 

2. A subset G C [l\\R of size Ci — r* is chosen. In the case of a projection game, all the 
indices in G are designated as game rounds. In the other cases, Ci/2 of the indices in G 
are designated (at random) as game rounds, and the remaining indices are designated as 
consistency rounds. Pairs of questions (Jg) are chosen accordingly. Note that the referee 
doesn't know the value of r* , but he doesn't need to explicitly distinguish between the game 
and consistency rounds, since they use the same distribution on pairs of questions. The 
distinction is made only as a convenience for the analysis. 

3. Finally, we let F = [£]\(R U G). F has size C2, and the indices it contains are designated as 
confuse rounds, with corresponding pairs of questions {q'^, qp). 

We will denote by {q',q) := {q'^qc^'p, qRqc^F) the £-tuple of pairs of questions chosen by the 
referee. Since questions on the indices in R always correspond to cases where for every answer of 
Alice there is a unique possible valid answer for Bob, and since we will only perform consistency 
(as opposed to game) checks on questions in those indices, we may regroup Alice's tuples of 

answers a'^ when they induce the same ar for Bob. Hence we re-define Ajjj^J^ := K^^q^ where 
the summation runs over all fl'^^ such that (fl^, Ar) are valid answers to the questions {q^^, qR). 

Our first claim shows that the players have a low success probability on blocks (R, q^) which 
are dead. 

Claim 18. Let e > Obe such that e > CiCj"^, and suppose that (R, qR) is an e-dead block. Then the success 
probability of the players, conditioned on the referee picking questions {q', q) such that q includes qR in the 
positions in R, is at most ^fle. 

Proof. The definition of (R, qR) being £-dead implies that 

Or 

^^The transpose sign on Bob's operators is there for consistency of notation. For simplicity we will omit this transpose 
in the future whenever we consider expressions of the form ( Y | A ® B | ¥) , which should be read as ( Y | A ® B ^ | Y) . 
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By applying Claim |26] to the Bg'^g together with Markov's inequality, we obtain that in expectation 



^G,qG 



LTr{BZ,cP'''K,cp''')] < £ + Ciq-i < 2e (21) 



where we used |G| < Ci and our assumption on e. Condition on {q'j^, q^) being chosen as part of 
the referee's questions in the game, and assume that the referee only checks consistency of Alice 
and Bob's answers to the questions in R. This can only increase their success probability, which 
can then be bounded as 



E(G,F),(ij'gq^,<jG9F) 



< Vie 



where we used that ((?f,(?F) are chosen according to a product distribution, the first inequality 
follows from Cauchy-Schwarz (recall the definition of || ■ \\p given in (|l])), and for the second we 
upper-bounded YlaR Wp W ^ used Jensen's inequality together with ((2T]| to bound the 

other term. □ 

We note informally that one can combine this claim with Lemma [171 to obtain a form of "direct 
product test" for entangled strategies. Indeed, if two entangled players Alice and Bob win the 
game with probability s » e, then by the previous claim a fraction at least s^/2 of blocks {R, (Jr) 
should be alive; moreover a non-negligible fractior^l of answers to those blocks must be (1 — 
?/)-serial. Hence one can apply Lemma [171 to those blocks {R,q^,a^) and obtain a product form 
for the corresponding marginalized strategy. 

The next claim shows that strategies which are product, even on a subset of the coordinates, 
also have a low success probability. 

Claim 19. Fix {R,q^,a^), and for every {i,qi), where i G [^]\R and qi G Q, let {IVg}a€A a fixed 
projective measurement. Suppose that Bob's strategy is such that, with probability at least 1 — S over the 
choice of (G, qc) and Gi C G of size \Gi\ = g, there is a partition Gi = G' U G" such that g" = \G"\ > 
(1 — 5')g and Bob's POVM satisfies that for every ac 

'^qRqc — \°qR) 

where for simplicity we wrote G" = {1, . . ■ ,g"}. 

Then the success probability of the players, conditioned on the referee asking questions {q', q) such that 
q includes q^ in the positions in R, and summed over all valid answers which include ar for Bob, is at most 



{S + e-^'-^-''^'s)Tr{B"g^^p) 



i^Note that one cannot hope to obtain any structural result on the strategies which would hold for more than a frac- 
tion s of questions or answers, as the player's strategy could be a mixture of a perfect winning strategy with probability 
s, and a random strategy with probability (1 — s). 
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Proof. Fixing the questions in R and G, and conditioning on the players consistently answering 
to {q']^, qR), their probability of being accepted is at most 

= ((Y|w^(B^^n.A;f;'r^nV...n^j...nV 



(22) 



The fact that sequential strategies cannot succeed in many rounds of the repeated game implies 
that 

|%ygwg)[ e ^S^'><?---nr--nj;;]|L < eM-i^-s-5'fg) 

Indeed, the expression on the left-hand side can be upper-bounded by the maximum success prob- 
ability of an Alice playing an arbitrary strategy and Bob a sequential strategy described by the 
measurements Ylq\, provided the referee only checks the answers to those questions in G" C d, 
where Gi is a random subset of G of size g chosen by the referee. But this success probability is 
even lower than the success probability that Alice and Bob would have if Bob played his sequen- 
tial strategy on all questions in Gi, but the referee was to accept as long as at least g" out of Alice 
and Bob's g answers were correct. Since the probability of such a serial strategy succeeding in 
any round is at most the value s of the original game, and g" > (1 — ^')^, by a Chernoff bound 
the probability that the players succeed in g" out of the g rounds is at most exp( — (1 — s — S')^g). 
Hence the expression in ((22)) can be upper-bounded, in expectation, by 

Finally, we must account for the small probability S that the serial property does not hold; for 
those sets G we can trivially bound the success probability, conditioned on Bob answering ur to 
by Tr(Bg». □ 

We finally turn to the proof of our main theorem. 

Proof of Theorem [71 We first set parameters: let Co be a large enough constant, e = Cq^3^ (recall that 
3 is the target value for the repeated game Gfk{^)), V = Cq^5^'^'^^{1 — s) (where C2 is the constant 
which appears in Claim [T6l>, g = Colog(l/c5)(l — s)^^, and I > Cl^S^^^^'^^{l — s)^^. Recall also 
that Ci was defined as Ci = and C2 = ^ — Ci . This choice of parameters satisfies the following 
constraints: 

• t] £^ > 16 C^^^^, which is used in Lemma [121 

• f/ > Cj^^^'^, which is used in Fact ll4l and subsequent claims. 

• £ > Ci Cj"^, which is used in Claim [iSl 

As before, in game Gpj^(^), we can think of the referee as first picking r* < Ci/2 pairs of 
questions (R, (tj^, qR)) for the players, then picking g pairs (Gi, [q^Q^, qci ) )/ then Ci — r* — g pairs 
(G2, {q'g , qG2)) ^rid finally C2 independent pairs of confuse questions (f , {q'p, qp))- Let G = Gi U G2 
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and {q',q) = ('7r'?g'?F' '?K'?G'?f )• Let {AJ^I},?' be Alice's POVM on questions q', and {B^}a Bob's 
POVM on questions q. 

By Lemma [121 one of two cases hold. Either a (1 — £) fraction of blocks (K, j^r) are e-dead, in 
which case the player 's success probability is readily bounded by e + Vic by Claim[l8l Otherwise, 
it must be that we are in case 2 of the lemma, so that e-alive blocks are for the most part serial. 
Note that any dead blocks contribute at most \/2£ to the success probability, by Claim[l8l A similar 
argument to that in Claim [181 shows that alive blocks which are not (1 — -serial also contribute 
at most \/2£, given the fact that we are in the case 2. of Lemma [121 and there can only be few such 
blocks by (|5|). 

Suppose {R,qR,aR) is (1 — /y)-serial. By Lemma [171 for every {i,qi) there exists a projec- 
tive measurement {nj^jla,, depending only on qR,aR,qi,ai, such that with probability at least 
(1 — 2?;^'' ^"^2 _ g-^g-j i^^gj. |.]^g choice of (G, qc) such that |G| = g there is a partition Gi = G' U G" 
such that g" = \G"\ > (1 — ^rj'^^'^)g such that Eqs. ((T8|) and ([T9)l from Lemma W7\ are satisfied, 
where pc = p^^^Bq^^q^p^^^. To alleviate notation we let n = Il"q\ ■ ■ ■ ^q^^„, and we first use Cauchy- 
Schwarz to bound 



'^Ql/y^C" 



[BZ;r)\id-n'n)B;Tqr\V 



UKT,r)\id-n'n)BZ;f 

"nil 



< W^lqjp (Y,Tr,MZf)\ld-Il'U)B;%f 



1/2 



I'rHg 



(23) 



where Pq = p^''^Bq^^g|0^''^, the first inequality is by Cauchy-Schwarz, the second uses {Id — 
n^n) < Id, the last is by Eq. ((T9|) from Lemma [171 and was defined in Eq. ((TO)) (where here we 
substitute B^^ for Xg^). A similar argument, using this time Eq. ((T8|) , lets us bound 



i—i ^ I q'q' 



( AaRUQi, 

y"qRqG 



and hence combining ( [231) and ([24l ) we get 



L 



v^'Jr'Jg 



I 



Finally, by Claim [26l we have 



HG,qG) 



E 



^ ' ^rIg 



<4||A"f. \\,E, 



^q'^qG^^P '='(G,qG) 



|g«R IK 

I Wg lip 



'JR'JG^ 

1/2 



< 4?/||A 



„l/2 



q'^qG^^P'^^R 
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where for the first inequality we used Y^a'^. n^n = Id, and for the second that > C2^. Hence 
the statistical distribution of outcomes produced by Alice and Bob (conditioned on answering ar 
to qR) is close to that which would be obtained if Bob was to use the operators {Bq'^)'^YI^TlBq'^ as 
his POVM on questions qc- ^^ut the success probability of the latter, when summed over all valid 
answers to the pair of questions {q'Q„, qc")/ can be bounded by Claim [I9l Hence summing over 
aU Ur (and using YlaR ll^^^'^^g \\p "^li^ — ^) taking the expectation over qR, the average winning 
probability of the players for all (1 — f/) -serial blocks {R, qR, ur) is at most 

where we also accounted for those (rare) choices of (G, q'^, qc) for which the previous bounds do 
not hold. Given our choice of parameters e, tj, g and £, it can be checked that this expression is <^ 5. 
Combining this bound with the one resulting from dead blocks shows that the winning probability 
of the players is at most 5, which proves the theorem as long as £ = poly ((5^^, (1 — s)^^) is large 
enough. □ 

We conclude this section by briefly explaining how the proof of Theorem [7| can be adapted to 
prove Theorem |9] The main reason the proof carries over is that, in the proof of Theorem [71 we 
only used the projection property for a subset of the game questions (to bound the success over 
dead blocks), while for (1 — ?/) -serial blocks the game questions were only used in conjunction 
with the fact that the value of the game was at most s. Here, consistency rounds will play the role 
of the game questions previously in R, and game rounds will play the role of those game questions 
previously in G (or rather its small subset Gi). 

Proof of Theorem^ In game GoRjg), we think of the referee as first picking r* < Ci/2 pairs of 
consistency questions {R, {qR,qR)) for the players, then picking Ci/2 — r* additional consistency 
pairs {R' , {q'Ri,qRi)), Ci/2 pairs of game questions (G, ((?g,'?g)) ^rid finally C2 independent pairs 
of confuse questions (f, {q'p,qF))- Let {q',q) = ((?r(?r,'?g'?F' '?R'?R"?g'?f)- 

Assume a choice of parameters made that is similar to the one in the proof of Theorem [71 As 
before, we can apply Lemma [121 to Bob's strategy B^, distinguishing between two cases. 

In the first case, a fraction (1 — e) of blocks (R, qR) are dead, for \R\ = r* . Then Claim [l8l again 
applies, as the only property we used in its proof was that any answer of Alice induced a fixed 
answer for Bob, which is the case for consistency questions. 

In the second case, a fraction e of blocks {R,qR) are alive. Those blocks which are dead can be 
dealt with as in the previous case, and we can focus on blocks (R, qR, ar) which are (1 — //)-serial. 
Here we can reason exactly as in Theorem [7[ using Claim [19] with Gi chosen as a subset of the 
questions in G, and the remaining consistency questions playing the role of the remaining game 
questions before. □ 

5 Approximate block-diagonalization of almost-orthogonal operators 

In this section we prove our orthogonalization lemma. Lemma [23] below, which shows that pair- 
wise almost-orthogonal operators are close to having a joint block-diagonal decomposition. The 
main ingredient in its proof is a robust orthogonalization lemma for families of pairwise almost- 
orthogonal projectors. Lemma [21] 
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Theproof of Lemma I2l1 is based on a variant of Schoneman's solution to the "orthogonal Pro- 
crustea^ 1 problem". Given any square matrices A and B, this is the problem of finding the orthog- 
onal matrix Q which minimizes 

Q := argmin ||A — Bn||p 

where ||X||| = £f^^Tr(X^X) is the normalized Frobenius norm. Schoneman llSch66ll showed that 
the optimal Q is O = ITV'^ , where UEF^ is the singular value decomposition of B^aII^ Indeed, 
given unit vectors \u\) , . . . ,\v]^ , one can let A be the matrix with columns the |m,), and B the 
identity. In this case, the orthogonal Procruste's problem consists in finding the best rigid rotation 
which maps the canonical basis of space to the vectors \vi) , where the error is measured in the 
least squares sense — the columns of the corresponding orthogonal matrix will then form an 
orthonormal family close to the 

We carry out this solution precisely in Claim |20] below, which, even though we will not use 
it directly, contains all the intuition necessary to solve our original problem on positive matrices. 
Unfortunately, the solution to the latter is made more involved technically by the the matrices not 
being of rank 1, and the slightly unorthodox (and, in particular, not rotationally invariant) way in 
which we measure the error. 

Claim 20. Let \u\),. . . , G C*^ he unit vectors such that ^ Y^i^j{ui, uj)^ < e. Then there exist orthog- 
onal unit vectors \vi), . . . , \vf^) G C*"' such that \ YLi || I";) ~ 1^;) ||^ ^ 

Proof. Let X be the kx k matrix whose columns are made of the vectors |m,), expressed in the 
canonical basis. The SVD of X is X = illLV^ , where II, V are unitary and E is diagonal with the 
singular values s, of M on the the diagonal. Then 

\t.{l-s}f = ||E+L-7d||| = \\X'X-ld\\l = ^El(«''"/)|' < ^ (25) 

where for the first equality we used the unitary invariance of the Frobenius norm, and the second 
is by definition of X and uses the fact that the |m,) have unit norm. Let Y = UV"^ . Y is a unitary 
matrix so its column vectors \ vi) form an orthonormal family. We have 

\ E II i«o - \\\ = u- y\\l = \\id-nl = \ Ea - s^f 
(=1 1=1 

which can be bounded by ^ since (1 - S/)^ < (1 - s,)^(l + s,)^ = (1 - s}f- □ 

We now extend this claim to the case of almost-orthogonal projections, which need not have 
rank 1, and to a slightly different way of measuring the error (most of the difficulty in proving the 
lemma comes from the different norm rather than from the higher rank). In order to understand 
the following, it may be helpful to first consider the case where p,- = [dk)^^ld for every i. 

According to Wikipedia, Procrustes, or "the stretcher", a figure from Greek mythology, was a rogue smith and 
bandit from Attica who physically attacked people, stretching them, or cutting off their legs so as to make them fit an 
iron bed's size. 

^^We are grateful to the user "ohai" of MathOverflow.net for pointing out the cormection between this problem and 
that of the robust orthonormalization of almost-orthogonal vectors. 
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Lemma 21. Let pi, i = l,...,kbe positive matrices, and p := J^j pj. Let Pi, . . . , Pj. be d-dimensional 
projectors such that 

Tr{PiPjPi pi)<e and X] ^K?/ Pj) < ^ 

for some < e < Tr{p). Then there exists orthogonal projectors Qi, . . . ,Qk such that 

i:Tr{{P,-Q,fp,)=0{e'^')Tr{pr' 

Proof. For every i write P, = |X;/)(x,||, where the are orthonormal, and let X, := 

Hi X '■— Hi^i' where |e,/) is the canonical basis: X has the as its columns. In 

order for X to be a square matrix, if necessary we extend the space in which the |x, ;) vectors live, 
so as to make it the same dimension as Span{ je, / ) }. The inner-product condition on the P; implies 
that 

YTr{PiPjPiPi) = ^ J^{xij\xjj,){xj^i,\xi^i„){xi^i„\pi\xi^i) < £ (26) 
i^j t^j 1,1', I" 

Write X"'"X = l^;,/') so that 

X]Tr((X+X-W)2x,V,X;) = X] E {Hi\Hi'){Hi'\Hi"){Hi"\Pi\Hi) < ^ (27) 

[j,v)Hhi)Ai,i") 

where we used (|26)l to upper-bound the expression in the middle by e. Indeed, in the second 
summation, if i = j then either V ^ I or V ^ I", so that one of the inner products or 
(x/ //|x, /w) is 0, since the {|^!,;)}/ are orthogonal. 

Let X = WLV^, where S is diagonal positive and U, V unitary, be the polar decomposition 
of X. By an appropriate choice of the basis we can assume that V = Id (if not, re-define 
Xj := XiV; this corresponds to changing |e,/) — > V^\ej^i)). Let Yl be the projector on the span of 
the eigenvectors of S with corresponding eigenvalue at least 1/2 and at most 2. n is needed to 
control eigenvalues of Z which may be too small or too large. 

Let U = Un and X = Xn. Let jj7j /) (resp. |xj,;)) be the column vectors of U (resp. X), so that 
U = We will show that the projectors Q, := Yli ^re close to the projectors 

Pi, in the sense claimed in the lemma (note that since U is unitary and LI a diagonal projector 
the Qi are orthogonal projectors, which do not necessarily sum to identity). We first state some 
consequences of ((27)) . 

Fact 22. The following inequalities holds 

Yj i^U ~ I - Hi' ) (Hv \pi I Hi ) < £ (28) 
i,i,i' 

Ym,i\p\ili,i) - {Hi\p\Hi)\ < 2V2e'/'Tr{py/' (29) 

Proof. We start with proving (|28)) . Since Z is diagonal, one can immediately check that X^X — Id = 
(X- JJ)+(X+ (J). Note also that (X + !J)(X + U)+ = !J(W + E)2(J+ > Id. Hence 

X]Tr((E - IdfxjpiX,) =YTr{{X - LJ)^X - U)Xjp,Xi) 
i i 

< YTr{{X-Uy{X + U){X+Uy{X-U)XjpiXi) 

i 

< e (30) 
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where the last inequality is by This implies that 1>((E - Jd)2(X,n)+p, (X,n) < e (note that 
n commutes with Z by definition), which is just ((28)) . 
Before turning to the proof of ((29l) , first observe that 

< 2e (31) 

where the equality uses that (E — Id)'^YI) is diagonal, and the inequality is by (|28| for the terms 
i = i and uses (E — W)^!! < Id and the second condition in the lemma for the terms i 7^ /. 
From ((3T]l we get 

- - %) = Tr(n(X - LJ)V(X - U)) 

< 4 Tr (EnE(X- LJ)V(X- fi)) 
= 4 Tr ((W - E)n(;d - E)xVx) 

< 8e (32) 

where the first inequality uses HE > 1/211, by definition of 11, and the last is by (|3T]) . 
We now prove ((29)) . By Cauchy-Schwarz, for every (z, /) 

{Ui,l - < {Ui,l - XQ\p\Ui,l - Xi^i)^'^{Ui^i\p\u^Y^'^ 

hence by (|32)) we see that 

i,l 

A symmetric inequality can be obtained, and ((29)) follows by the triangle inequality. □ 
As a consequence of Fact|22l note that 

1 /2 1 /2 

V) {^i,i'\pt\Hi - ^u) < ( J2{Hi\Hi'){^i,i'\pMt,i)) (E("'.' ~ Hi\pMi,i - 

< Tr{pf'^ ■ (8£)i/2 = 0(£i/2)Tr(p)i/2 (33) 

where the first inequality is by Cauchy-Schwarz (and the | M,y ) being orthonormal) and the second 
uses XiX\ < Id, and (|32]) (with pi < p). 

In order to bound the distance between Q, = YLi 1 ^rid P„ we first bound the distance 

between Q, and P, := X,X/: 

Y,Tr{{Pi - Qifpi) = Yj {{hi\Pi\hi) + -21] K((m;;|%) (x,y/|p/|l7,y)) 

< 2X](%|p,|x,,) -2X:3?((%/|V) (%|p,|x,,)) +0(£i/2T,(p)i/2) 

< 0{£^'^Tr{pf'^) (34) 
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where the first inequality is by ((29)) and ((33)) and the second by ((28)). It remains to bound the 
distance between the P, and the P, : 

i i 

<2Y^Tr{\Id-^xlpiXi) 

i 

<l{Y.Tr{{ld- Y.fx\p,X,) Tr{xjp,X,) ) 

! i 

< le'^^Trip)'/^ (35) 

where the first inequality uses [Id — Yl) < 2|E — Jd| by definition of H, the second is Cauchy- 
Schwarz and the last is by ((30)l . Combining ((34)) and ((35]) finishes the proof of the lemma. □ 

Lemma[2T)lets us prove the orthogonalization lemma below. In that lemma one can think of the 
Yj as operators in the Stinespring representation of a measurement : p i— > y, (p (8) Id)Y^, where 
i refers to the z-th outcome of the measurement. In that setting the hypothesis of the lemma is that, 
when A4 is performed twice sequentially on a specific state p, it is likely that identical answers 
will be obtained. The conclusion is that the operators Y, have an approximate joint block-diagonal 
form, as described by the orthogonal projectors IT,. 

Lemma 23. [Orthogonalization Lemma] There isac > such that the following holds. Let pi, i = 1, . . . ,k 
be positive, p such that Y^j pi < p and Yi, i = 1, . . . ,k (possibly rectangular) matrices, be such that 

Y^Trp^{Yl{YjYl)t)<^Tr{p) (36) 

and YiYj < Id. Then there exists orthogonal projectors {Tlj} such that 

Y2Trp,{Yl{Id - n,)y,) < 0{a')Tr{p) 

i 

Proof. The idea of the proof is simple. Let j6i, > be parameters to be chosen later. For every 
i, let Pi be the projector on the eigenvectors of % Yj with corresponding eigenvalue at least jSj. 
Since P, contains all the large eigenvalues, P, Yj ~ Y,. Moreover, by definition P, < /3^^ Y, Y^. These 
two properties together with ((36)) almost imply that Y^i^j Trp. {YfPi Pj PjY,) < fi^^aTr{p). Choosing 
/3i y/a, we could then apply Lemma [IT] to the Pi and states cr, := YipiYj , recovering close 
orthogonal projectors O, which would satisfy the required condition. Carrying out this intuition 
precisely is a bit tedious, and we now proceed to the details. We will use the following simple fact. 

Fact 24. Let A > 0, p > 0, and H a projection. Let 

a = Trp{A), b ^ \Tr p{{ld - U) AU)\ and c = Tr p[{ld - U) A{Id - U)) 
Then both the folloiving hold 

Trp{nAn) < {^+^f <2{a + c) 
TrpiUAYl) < [^'^f^y<a + 2b 
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Proof. Write n = (TL - Id) + Id, so Trp(nAn) < \Trp{{n - Id)An)\ + \Trp{An)\. The second 
term can be bounded by a^^^Trp{YlATiy^^ by Cauchy-Schwarz. Similarly bounding the first term 
by c^^^Trp{rLAri)^^^ yields the first equation. To get the second, let X = rrp(nAn)^^^ to obtain 
the equation 

Solving and using X > 0, one finds that this is equivalent to X < (a/a + \^ a + 4b)/2. □ 

Let y_, := Y^j^i yjyj < id, and Q, be the projector on the eigenvectors of PiY^jPi with eigen- 
value at most /32. Note that, by definition, Qj < Pj < fi^^YiYf (and in particular Qi commutes 
with Pi). We first bound the distance between Y/" and Y/Q,-: since Y/(7d - Qi) = Yl{Id - Pi) + 
YlPi{Id-Qi)Pi, 

X]rrp,(Y/(M-QOY,) =X](rr,,(Y+(7d-P0Y,) +Trp,(Y/P,(Jd-Q (37) 

i i 

The first term is easily bounded by j6i Tr{p). For the second, note that Pi{Id — Qi)Pi < f^2^PiY_iPi. 
UsingFactlHwith A' = Y_,-,n' = P,-,andp' = Y,p;(Y,)+ we get£; fl' < ixTr{p) and^iC' < ^iTr{p), 
so that 

^ Trp, {YlP^Y^AYi) < 2{a + ^^)Tr{p) 

i 

Assuming a < j6i (which will hold for our choice of parameters), from ((37|) we get 

Y,TrpXYhld-Qi)%) < 0{^2'^i)Tr{p) (38) 

Next observe that, by definition of Q„ followed by an application of the Cauchy-Schwarz 
inequality, 

X] I Trp, {YtQiY-i{ld - Q,)%) | = ^ | Tr,, (Y+Q, Y_,( W - P,)^/) | 

i i 

< ( E ^'P. iVild - P,)%) Tvp, {VQrY^_,Qi%) ) 

i i 

< ^\'^^2Tr{p) (39) 

where we used Q, Y^ -Q, < ^\ld, which holds by definition of Q,, to bound the second term in the 
last inequality. Using the second bound in Fact l24l with A' = Y_;, TV = Qj, p' = Y,p;(Y,)^, we get 
a' < aTr{p) and Zi b' < ^Y^h'^^ip) W ISH), so that 

E^rp^i^lQiQjQiri) < fi^'J^Trp^iYlQ.Y^iQiYi) 
</3-i(« + 2/3|/2/32)rr(p) 

Set jS2 = i^i^^ and = c&'^ to obtain 

X] Tr,, {StQiQ]QiY^ < 0(«i/5) Trip) (40) 
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Let (7i := YipiYj . We are now ready to apply Lemma |2T] to the Q; and cr,: the first condition holds 
by (|40)) , and the second is a direct consequence of (|36)) and Qj < fi^^YjYj' for every The lemma 
then gives us pairwise orthogonal H, such that 

i 

Combined with ( [38l > and the triangle inequality, this leads to 

rr,,(y/ ad- not,) < 0(^1/10) Tr(p) 

i 

□ 

6 Discussion and open questions 

Our work shows for the first time that the entangled value of games can be decreased through 
parallel repetition. Even though we framed and proved our results in the context of 2-player 
games, it should not be hard to extend them in some cases to multiple players, depending on 
the kind of projection or consistency constraints that one can assume on the game. On the other 
hand, extending the result to either many-round games, or games with quantum messages, is an 
interesting open question. 

One implication of our result is the following. The celebrated PCP theorem says that given a 
game, it is NP-hard to tell if its value is 1 or less than, say, 0.99. Combined with Raz's parallel rep- 
etition result, one obtains that it is also hard to tell if the value is 1 or less than, say, 0.01. The latter 
statement led to an enormous body of work on strong hardness of approximation results BHasOlL 
It is currently a major open question whether an analogue of the PCP theorem holds for the en- 
tangled value. If such a result was proved, our results would allow to amplify the hardness to 1 
vs. 0.01, as in the classical case, possibly leading to further surprising implications. 

The main open question left by our work is whether it is possible to show a better rate of decay, 
in particular an exponential rate as Raz obtained from direct parallel repetition, or IIIKW09II first 
obtained in the setting of direct product testers. Another open question is whether our statement 
can be extended to hold for simple parallel repetition for arbitrary entangled games (i.e. without 
adding dummy or consistency questions). 

We believe that our main conceptual contributions are the extension of the notion of "approx- 
imately serial" to the setting of measurements, and our subsequent orthogonalization lemma. We 
hope that these techniques might prove useful elsewhere, perhaps in establishing hardness of 
entangled games. Lastly, product testers are very useful in the area of property testing, and it 
remains to be seen if our result can be applied similarly. 
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A Some useful technical facts 

In this section we prove a series of useful claims showing that, in a strategy which has been 
marginalized over a large number of indices, fixing a particular coordinate (z, (J;) does not have 
much influence on average. Throughout this question we fix a question set Q and a distribution 
}i on Q. Whenever an expectation over tuples of questions q G Q'' is taken, it will be over the 
product distribution pi'-. 

Our claims will rely essentially on the following, which applies to any matrix semi-norm || • ||, 
provided it is derived from a semi-inner product (■,-). 
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Claim 25. Let C be an integer, and / : Q'^ ^ { X G C'^^'^ }. Let M = Eq [f{q)] and for any {i,qi), 
Mi,q, = Eq^, [f{q)]. Suppose that [\\f{q)f] < 1. Then 

1. 0<E,,,[||M-M,,,|p] <^^^^<1 

2. E,,,, [\\M-M,J^] = E,,, [\\M,J^] - ||Mf . 

3. Pr,g, (I Tr(M) - rr(M,-,,,)| > C-i/3) < c-^/\ 



Proof. The proof of all three parts is in close analogy to that of Lemma 2.1 in IIO'DOSaL which 
shows similar statements for a Boolean function /. For part 1 note that E, q. [IjM — M,;qjp] = 
h EF=i [I|M - H-qJP] and hence it suffices to show that Ep^^ Eg, [||M - M,-,g,f ] < Tr(M). Ob- 
serve that 



< E„ 



\\f{cj)-£{M,,,-M)\ 



''7 



= £jll/('?)f]-E£^.[l|M-H-,,,f], 



where for the last equality we have used that Eg, [M,;^. — M] = and hence Eg. [(M,;g. — M, M,;g.)] 
[(M;,g,. - M,M,;g,. - M)] and, for i ^ j, 



(M - Mi,y, M - Mj,j] = (Eg, [M - M,-,g,] , E, 



M-M 



)=0 



Part 1. now follows, and the second inequality is simply the assumption that E^[||/((j)|p] <1. 

Part 2 is trivial from the expansion of ||M — M;^g. |p. Part 3 follows from part 1 using Markov's 
inequality, which gives Pr,;g.((rr(M-M,;g.))2 > C^^/S) < C^/^E^.^. [( Tr (M - M;,g,.))2] . Observing 
that for A := M- Mi^qi we have {Tr{A)f = {A,Id)^ < \\A\\^ ■ \\Id\\^ = \\A\\^ gives the desired 
bound. □ 

The following is a direct corollary of Claim |25l obtained for a specific instantiation of the norm 

Claim 26. Let Yq,for q ^ Q'^ and a G A*-, he positive matrices such that Yg '■= J^a^q ^ ^d, and p > 0. 
Let Y = Ec, [Yg] . Then 



^{i,qi) 



Tr{Y p^'^Y p"^) - Tr{Yq^p^'%^p"^) 



l/2v ^1/2A 



Tr{YcfP^'%p^'^) < Trp{Y) 



Proof. The statement follows from Claim |25l applied to f[q) = Y^ and the (semi)-norm = 
Tr(Ap^''-^A"'"|0^''^), whichis derivedfrom the inner-product (A, B) i— > Tri^Ap^^-^B'^ p^^'^Y Thesecond 
inequality holds since < Yg < 7d for every q. □ 

We now give two simple calculations which will be useful. The first is a well-known operator 
version of the Cauchy-Schwarz inequality. 
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Claim 27. Let A, B be (possibly rectangular) matrices such that A'^B exists, and B^B is invertible. Then 

{A^B){B^B)-\B^A) < A^A 

Proof. Let A = {B^By^{B^A). Then the matrix (A - BA)"^(A - BA) is positive, which gives the 
result. □ 

Claim 28. Let G C^''^ < Yg < Id Jar q G QC and let Y = [Y^], Y,-,,, = E,^, [Y,] for i e [C]. 
Then 



Proof Write 



< (y, - - Y)) (y, - - Y) 



Y^ 



- L -y) + in,. - y)y,) + E (n,, - y) ( v - ^) 



Taking the expectation over q, we obtain 



Dividing by C on both sides proves the claim. 



□ 



Claim 29. For every q E Q'~ let {X^l^eAC ^ POVM, and := TT{q)^X^ (g) (ij, fl| (as described in 
SectionEB, and p>0. Assume that XX^ = E« Eg ^qi^q)^ < Then 



Proof LetXf = |X''(X'')+ - X^,(X^,)^|, and = |XV(X'')+ - X^_|0(X^_)+|, where the notation 
keeps the dependence on qi implicit. Use the triangle inequality to write 

|Tr(X"(X«)+XV(X")+) - rr(X^",(X;,)'X,V(X«,)+) | < rr(X«XV(X")^) + Tr(x;,(X^",)V~n (41) 

The expectation of the first term on the right-hand side of (|4T]) can be bounded by Cauchy-Schwarz 
as 



Tr(XfXV(X'') 



< E, 



Trp{{X''yX''Y'^Tr{{XlfX''p{X'') 



< C-^^^Trpi^iX^yX") 



by Claim |25l applied to the (semi)-norm \\Af := Tr{{A^A) {X^p{X''f)) and the mapping / : q ^ 
X?(X,")+. 

Regarding the second term on the right-hand side of (|4T]| , let A be the block-column matrix 
with blocks a/ TT{qi)p" for every {i,qi) and a, and B with blocks y7r(^Xf (Xf )+. Then B+B = 
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Ea E(i,,,) [(X;'(Xf )+)'] < Id. Let D = A+B = Ea [p^Xf(Xf)+]; the operator Cauchy-Schwarz 
inequality from Claim |27| gives 



Applying Claim |28l to X^p(X^)"'" (for every a), we can then bound 

X^p^Xj 



(XqpX})2 



(42) 



where for the second inequality we usedX+X, < Id. Since Tr(D) < Tr{VDD^) = ||D||i, taking 
the square root on both sides of (|42)) (the square root being operator monotone) and then the trace, 
we obtain 

^ < C-^'^Tr,[E„ 



^£(,,,.) Tr(p-X?(Xf)^) 



C 



-1/21 



where X is the rectangular matrix with square blocks Tz{q) ^^'^X'^ arranged in a column. By 

Holder's inequality ||Xjo||^ < Tr(|0) ||X||oo/ and ||X||oo < 1 since X"'"X = 
finishes the proof of the claim. 



XqX^ 



< Id. This 
□ 
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